An Exercise in Fault-Containment: Self-Stabilizing Leader Election

نویسندگان

  • Sukumar Ghosh
  • Arobinda Gupta
چکیده

Self-stabilizing algorithms are designed to guarantee convergence to some desired stable state from arbitrary initial states arising out of an arbitrarily large number of faults. However, in a well-designed system, the simultaneous occurrence of a large number of faults is rare. It is therefore desirable to design algorithms that are not only self-stabilizing, but also have the ability to recover very fast from a bounded number of faults. As an illustration, we present a simple self-stabilizing leader election protocol that recovers in 0( 1) time from a state with a single transient fault on oriented rings, Only the faulty node and its two neighbors change their state during convergence to a stable state. Thus, the effect of a single fault is tightly contained around the fault. The technique for transforming a self-stabilizing algorithm into its fault-contained version is simple and general, and can be applied to other problems as well that satisfy certain properties.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault - Containment in Self - Stabilizing Distributed Systems

Self-stabilizing systems can automatically recover from arbitrary transient faults, and changes in the environment of the system, without any external intervention. However, in existing distributed self-stabilizing protocols, the performance of recovery is not linked to the severity of the fault. Recovery from failure at even a single component of the system may take a long time and aaect the o...

متن کامل

Robust Stabilizing Leader Election

In this paper, we mix two well-known approaches of the fault-tolerance: robustness and stabilization. Robustnessis the aptitude of an algorithm to withstand permanent failures such as process crashes. The stabilization is a generaltechnique to design algorithms tolerating transient failures. Using these two approaches, we propose algorithms thattolerate both transient and crash fail...

متن کامل

A Universal Self - Stabilizing Mutual Exclusion Algorithm ∗ 3 Q @ nM 5 < ! 9 - EgBg

A distributed system consists of a set of processes and a set of communication links. A distributed system is said to be self-stabilizing if it converges to a correct system state from arbitrary initial system states. A self-stabilizing system is considered to be a fault tolerant system , since it tolerates any kind and any finite number of transient failures. The leader election and mutual exc...

متن کامل

A Universal Self - Stabilizing Mutual Exclusion Algorithm ∗ 3 Q @ nM 5 < ! 9 - EgBg 3

A distributed system consists of a set of processes and a set of communication links. A distributed system is said to be self-stabilizing if it converges to a correct system state from arbitrary initial system states. A self-stabilizing system is considered to be a fault tolerant system , since it tolerates any kind and any finite number of transient failures. The leader election and mutual exc...

متن کامل

Stabilizing leader election in partial synchronous systems with crash failures

This article deals with stabilization and fault-tolerance. We consider two types of stabilization: the selfand the pseudostabilization. Our goal is to implement the selfand/or pseudostabilizing leader election in systems with process crashes, weak reliability, and synchrony assumptions. We try to propose, when it is possible, communication-efficient implementations. Our approach allows to obtai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Process. Lett.

دوره 59  شماره 

صفحات  -

تاریخ انتشار 1996